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1. INTRODUCTION 

Distribution is one of the most important aspects in the industry, contributing to delivering products 
from producers to consumers [1]. Without a systematic distribution channel, the best promotional products 
and strategies will not make the product known and consumed by the end user [2]. The goal of systematic 
channels and distribution is to deliver products for customers on time and reduce operational costs industries. 
Toth and Vigo [3] revealed in Europe and North America recognize that using of computerized procedures 
for planning process distribution can reduce transportation costs by 5-20%. It is using mathematical 
programming will be more effective in the process of determining the route management of goods service 
distribution. Determination of goods distribution mathematical terms is Vehicle Routing Problem (VRP). 
VRP is a combinatorial problem that discusses the process of goods distributing from a depot (center 
distribution) to customers scattered in various locations with the limited vehicle, the distance between depot 
and customer, vehicle capacity and time [4]. 

VRP has been proposed since 1959 by Dantzig [5] and the VRP issues continue to grow. VRP with 
Time Window (VRPTW) is an important problem in the logistics and transportation system because it affects 
the profit company. Research conducted by Zhong and Pan [6] solve the problem VRPTW with the aim of 
minimizing costs and less obtained results than optimal because resulting cost not suitable with the real. In 
the study Zhou and Wang [7] It is mentioned that VRPTW consists of a set of customer served by one 
vehicle from the main depot and each vehicle is only passing once, with limited payload and time in 
accordance with the availability of customer. 

Due to only once passing through the route, there needs to be regional grouping to serve many 
customers, thus making a satisfied customer. According to MacQueen [8], k-means is a method of doing 
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clustering will get maximum results. Setting the right center point on each cluster can provide a precise 
grouping of data [9]. K-Means is good method for clustering which get solution quickly. Clustering method 
got solution in missing value problem [10]. Based on that, k-means can give good solution for solve 
clustering priority areas. 

Based on the observations made on the Home Industry of tempe chips. Home The industry has a 
wide marketing area and limited vehicles in meeting customer demand. Distribution is sometimes not meet 
the time set by the customer so many complaints received. The extent of marketing area, limited vehicle and 
limited time can be overcome by doing strategic planning such as by using route planning through which the 
distribution process, the capacity is loaded by each vehicle and priority areas to receive orders first. Such 
planning can be assisted with the help of a computer using a method in artificial intelligence like the K- 
Means method [8], [9], [11] to categorize priority areas. Based on the problems, the researcher proposed an 
area mapping method using K-Means and Genetic Algorithm for route optimization to faster time of 
computation than conventional current method in home industry. 

The Genetic Algorithm can solve the problem for route optimization on VRP issues [12] With fast 
computing time [4]. The research of Mnasri, et al [13] the genetic algorithm is used to optimize the route on 
the VRPTW problem by improving the process on the crossover resulting in the best route and minimum 
cost, then the research by Lesmawati [14] using a genetic algorithm for the distribution of frozen food, 
subsequent research by Philip, et al [15] using genetic algorithm for distribution route. From several studies 
conducted to prove that the genetic algorithm to get the optimal route with fast computing time and suitable 
for multi objectives optimization problem [16]. In addition to the wide space problems and complex genetic 
algorithms can find the most optimal solution [17]. The focus of this research is to solve the complex 
problem of VRPTW by combining k-means method and genetic algorithm on tempe chips distribution 
Malang. K-means do clustering priority areas that visiting by vehicle and genetic algorithm do shceduling 
vehicle which visiting the customers. 


2. RELATED WORK 

Various improvements to the genetic algorithm have been done to solve the VRPTW problem. As 
mentioned in the research of Mnasri, et al [13] There are modifications to the chromosome initialization 
process, generated by Solomon's insertion, random interchange, random ordering. Further modification by 
Chunyu and Xiaobo using a cross-order operator and the partial route reversal operator route to increase the 
convergent speed [18]. Research Yucenur and Demirel [19] perform clustering using nearest neighbor and 
genetic algorithm to solve the problem of VRP with the merger of these two methods of computing time is 
used more quickly. Cheng, et al [20] incorporating k-means and GA to create adaptive clusters aimed at 
reducing the complexity of time and space. The results show that the method is feasible and effective in 
conducting cluster analysis. Zhaoxia and Hui [21] using a genetic algorithm to determine the initial center of 
the cluster in order to obtain maximum results and prove that method is suitable for use both in small data 
groups and in more complex data groups. Further research conducted by Krishna and Murty [22] combines a 
genetic algorithm and k-means to resolve the problem of global search, k-means method is used to limit the 
search to the crossover operator in genetic algorithm, produce a method is name GKA (Genetic K-Means 
Algorithm), which includes the k-means operator on genetic algorithms operator. Differences in this study k- 
means clustering method in use prior to classify the area then the result of the k-means used to input on 
genetic algorithms for optimized route followed. 


3. RESEARCH METHOD 
The data used in this study come from a home industry tempe chips "XYZ" Malang and distance 
data measured by using the Google Maps tool. Data contained in attachment. 


3.1. K-Means 

For clustering process, a method that how it works simple and get optimal results is k-means [8]. The 
first thing done in the process of clustering initializing values of k is the number of clusters, and then 
determine the center point for each cluster randomly. In this study, the determination of the central point is 
set manually. Each object will be inserted and set to a particular cluster, in which each object is included in 
the same cluster have similarities. To determining the similarity of the object with the center point of the 
cluster is measured by Euclidian distance (D) shown in Equation 1. An object will enter the cluster that has 
the smallest D values. 





D(x2,xD= J¥pj = 1(x2 — x1), (1) 
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where; 

p = dimension data 

xl =position of point 1 
x2 = position of point 2 


The flow of k-means clustering algorithm is as follows [9] : 

1. Initialize the value of k cluster and each cluster centroids. 
Determine each object include the cluster with the closest distance based on the value of Euclidean 
distance. 

3. To recalculate the value of the centroid of each cluster with Equation 2. 


1 
Vje= . Yp=0 datay, (2) 


Vis the value of centroids from cluster j. ; is the number of objects in cluster j. data, is vector data 
to p. 
4. Repeat steps 2 until centroid value unchanged or beyond iteration specific. 


3.2. Genetic Algorithm 

The genetic algorithm is an optimization technique that are stochastic and based on Darwin's theory 
of evolution [23]. The process of the genetic algorithm includes the representation of the chromosome, the 
formation of the population, the calculation of fitness, reproduction with crossover and mutation, selection 
process to get new individuals. 
1) Chromosome Representation 

To explain the chromosome representation, a simple example is given. There are 6 customers (P11, 
P12, P21, P22, P31, P32) spread across 3 distinct regions, the number of vehicles to serve customers are 3 
pickups (V1, V2, V3) containing the payload 625 tempe chips. 

Its chromosome representation uses a permutation representation with 2 segments. The first segment 
is the distance between customers in different regions and the second segment is the number of vehicles used 
for the distribution process. Figure | illustrative representation of chromosomes. 
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Figure 1. Representation Chromosomes 


In Figure 1, the blue colour indicates customers in each region and the yellow colour containing three 
genes which means that the gene | in the column V1 serve customers in the regions P11 and P12, gene 2 in the 
column V2 serve customers in the regions P21 and P22 as well as the gene 3 in column V3 serve P31 
and P32. There are 50 stores for destinations and 3 pick ups so each vehicle gets 16 destinations with 
assumed 6 working days. Time duration to drop of goods is 30 minutes. If serving passes deadline time, then 
calculated penalty count. 

2) Fitness Calculation 

After the generation of value chromosome then calculating the fitness value. This calculation shows 
the ability of individuals to survive and continue the next process [23]. The fitness function is shown in 
Equation 3. 


fitness = (1/ (1 + tO) + (tp * (-1)), @) 


tt is total travel time, fp is the total number of penalty/ violations. Penalty is calculated from the sum of waiting 
time and the over time to drop products based on time window. 
3) Crossover Process 

Crossover process is one method of reproduction to produce new individuals (offspring). The method 
used is the one cut point. New individual is generated by multiplying the number of population (pop size) and 
the crossover rate (cr) is determined. On this issue 0.6 cr value and pop size 5. So the results from multiplying 
0.6x5=3 offspring. Then pick two individuals at random as a parent for the reproduction process. Individuals 
selected are P1 and P3. Figure 2 represent individuals selected. 
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Figure 2. Crossover process 


After parent selected. The next step is to choose randomly cut-off the gene for the crossover process. 
Figure 3 shows the crossover process. 
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Figure 3. Offspring Crossover Result 


Offspring in segment | obtained from parent 1 with 2 genes are P31 and P32 then the reminder is 
obtained from the parent 2 that is not in the offspring. In the segment 2 the selected genes are the two genes so 
that between parent | and parent 2 only exchange the gene to one and the last while the central gene remains. 
Mutation Process 

Mutation process in the genetic algorithm is optional, depending on the problem resolved. In this 
study, mutation method used is the exchange mutation. It works of selecting 2 genes randomly to exchange 
each selected gene. The parent used in the mutation process is only one parent and randomly selected. For 
example, has been selected parent 5. Figure 4 shows the process mutation. 


P5 1 3 6 2 1 {2 |3 
C4 1 4 3 6 2 5 1 |2 |3 


Figure 4. Offspring Mutation Result 



































4) Selection Process 

The selection process to establish a new population use replacement selection, because of the 
problems of multi-trip VRPTW by Anggodo, et al replacement selection gives maximum results [17]. 
replacement selection method works by replacing a chromosome, chromosome reproduction will replace its 
parent chromosome when it has a higher fitness value. 


4. EXPERIMENTAL RESULT AND DISCUSSION 

Testing method used in genetic algorithms to solve the problem of VRPTW tempe chips distribution 
by doing 3 testing there are testing of population, testing of generation, testing of combination crossover rate 
(cr) and mutation rate (mr). Each testing of parameters in the test is five times. Because the genetic algorithm 
is a stochastic method to produce a different solution each time running, therefore the fitness of each test is 
calculated from the average value of fitness [24]. 
1) Testing of Population Size 

In this test based on multiple population number of 100 to 2000 of the population, the combination 
of cr and mr 0.6 and 0.4 set based research Lesmawati, et al [14]. The test results are in Figure 5. 
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Figure 5. Population Testing 


Based on the results of the test population, the fitness value which is used has the highest value in a 
population of 2000 with fitness 2.34728e+12. Testing discontinued in 2000 due to population fitness value 
generated is not limited and may not convergen and long computation time. 
2) Testing of Generation/Iteration 

Testing iterations/generation were performed using the best fitness results from testing a population 
of 2000 with a multiple of 100. For more details, contained in the Figure 6. 
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Figure 6. Iteration test multiple 2 


In Figure 6 shows that the best fitness values at 1600 iterations with the fitness value 1.22059e+11 
and after it convergent and the fitness values resulting lower. 
3) Testing of combination cr and mr 

Testing combination cr and mr with a range of 0-1. The test parameters based on the best number of 
population is 2000 and the best number of iteration is 1600. Figure 7 result illustration of the test 
combination cr and mr. 
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Figure 7. Combination cr and mr 
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From Figure 7, showing that the combination cr and mr value are the ability of the genetic algorithm 
in the exploration and exploitation in the search space. Cr and mr value will be different for each problem. In 
this study, the combination cr and mr get the highest fitness value in 1.0 and 0.0 with a value of 
2.9685 1e+12. The results of the test show the value of cr decreased and mr increased tend to be ugly. Large 
space solutions require a search operation that focused on global search. The crossover process tends to reach 
the outer regions compared to its nearest neighbors while the mutation process is more focused on reaching 
its neighbors where the mutation process only exchanges one gene in one individual. So the result of a large 
crossover process tends to give a good value than the mutation, although at mr value | the fitness results are 
not decreased but better than the previous mr 0.9. 

After testing the parameters then testing the best parameters and comparing the results obtained 
from the Genetic Algorithm and Hybrid K-Means Genetic Algorithm, the test is done with 10 times the 
program run then taken the average fitness value. Table | shows the comparison results. 


Table 1. Result of Comparison 








Method Fitness Computation Time 
Genetic Algorithm 8.07E+5 3900 seconds 
Hybrid K-Means and Genetic Algorithm 3.0312E+12 2700 seconds 





Based on the fitness value obtained in Table 1 shows that Hybrid K-means Genetic Algorithm is a 
combination that matches and briefly relative computation time. The proposed method to get better results 
because the process of formation of travel routes have been classified each region and the route of the vehicle 
will not exceed the area that has been determined. The chromosomes formed in the genetic algorithm are 
shorter so the computation time is faster. In addition, just use genetic algorithm often gets penalty because 
has long rute and time distribution begin 6am until 12pm then makes genetic algorithm’s fitness very low of 
k-means genetic algorithm’s fitness. 


5. CONCLUSION 

In this paper, some testing has been performed using k-means and genetic algorithms. Based on 
three results testing has been done are population testing, iteration testing and combination cr and mr testing 
can be concluded that in 2000 population, 1600 iteration, combination cr 1.0 and mr 0.0 produce the best 
fitness value with fast computation time. 

The combination of k-means method and genetic algorithms is the the right combination. K-means 
can cluster right and genetic algorithm can optimally optimize each cluster of routes and inter cluster 
relations. Based on computation time, it is better than the genetic algorithms do optimize all of the point 
routes. In addition, to improve the result need to be combined route optimization that focuses on finding local 
and tracing more deeper solutions one of them can apply auto-speed acceleration algorithm to replace 
crossover process which is not optimal [25], [26] and it is also possible to use many parameters i.e spatial 
data werehouse, geographic information system [26]. The applicability of this paper can be applied directly 
to the complexity of the distribution of goods, resulting in lower cost and more efficient distribution time. 
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